Learning Continuous Depth Representation via Geometric Spatial Aggregator

نویسندگان

چکیده

Depth map super-resolution (DSR) has been a fundamental task for 3D computer vision. While arbitrary scale DSR is more realistic setting in this scenario, previous approaches predominantly suffer from the issue of inefficient real-numbered upsampling. To explicitly address issue, we propose novel continuous depth representation DSR. The heart our proposed Geometric Spatial Aggregator (GSA), which exploits distance field modulated by arbitrarily upsampled target gridding, through geometric information introduced into feature aggregation and generation. Furthermore, bricking with GSA, present transformer-style backbone named GeoDSR, possesses principled way to construct functional mapping between local coordinates high-resolution output results, empowering model advantage shape transformation ready help diverse zooming demand. Extensive experimental results on standard benchmarks, e.g., NYU v2, have demonstrated that framework achieves significant restoration gain compared prior art. Our codes are available at https://github.com/nana01219/GeoDSR.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Hierarchical Feature Representation in Depth Image

This paper presents a novel descriptor, geodesic invariant feature (GIF), for representing objects in depth images. Especially in the context of parts classification of articulated objects, it is capable of encoding the invariance of local structures effectively and efficiently. The contributions of this paper lie in our multi-level feature extraction hierarchy. (1) Low-level feature encodes th...

متن کامل

Spatial Representation: Discrete vs. Continuous Computational Models

Topological notions and methods are used in various areas of the physical sciences and engineering, and therefore computer processing of topological data is important. Separate from this, but closely related, are computer science uses of topology: applications to programming language semantics and computing with exact real numbers are important examples. The seminar concentrated on an important...

متن کامل

Video Representation via a Continuous Probabilistic Framework

In this work we describe a novel statistical video representation and modeling scheme. Video representation schemes are needed to enable segmenting a video stream into meaningful video-objects, useful for later indexing and retrieval applications. In the proposed methodology, unsupervised clustering via Guassian mixture modeling extracts coherent space-time regions in feature space, and corresp...

متن کامل

Learning Via Compact Data Representation

We present an unsupervised learning methodology derived from compact data encoding and demonstrate how to construct models of polysemy, priming, semantic disambiguation and learning using this theoretical basis. The model is capable of simulating human-like performance on artificial grammar learning.

متن کامل

Depth Based Image Registration via 3D Geometric Segmentation

Image registration is a fundamental task in computer vision and it significantly contributes to high-level computer vision and benefits numerous practical applications. Although many image registration techniques have been proposed in the past, there is still a need for further research because many issues such as the parallax problem remain to be solved. The traditional image registration algo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i3.25369